See what's new in data management.
Importing/exporting data
JDBC support
- Import data from Oracle, Microsoft SQL Server, MySQL, Amazon Redshift,
Snowflake, and other databases
- Export data to an existing database table
- Execute SQL statements on a database
- Create data source names to store connection settings
- Support for CLOBs, BLOBs, and Unicode
- Import data using GUI
ODBC support
- Import data from any ODBC data source, such as Oracle, SQL Server, Access,
Excel, MySQL, and DB2
- Export data to new or existing ODBC tables
- Execute custom SQL commands individually or in batches
- Customize ODBC connection strings
- Support for ODBC
- Support for VARCHARs/CLOBs and BLOBs
- Support for Unicode
Built-in spreadsheet editor
- Clipboard Preview Tool lets you control how data will be pasted
- Manage variables with the Variables Tool
- For Windows , Mac , and Unix
- Pinnable rows and columns
- Resizable cell editor for string data
- Tool tips for truncated text
- Proportional width font support
- Columns can be resized and are preserved when saving the dataset
- Show variable labels in column header
- Keyboard shortcut for hiding and showing value labels
Properties window
- Manage variables
- Manage dataset properties
- For Windows , Mac , and Unix
Variables Manager
- Change storage types, names, and formats
- Add and edit value labels
- Attach notes to variables
- Filter variables
- For Windows , Mac , and Unix
Functions
- Statistical functions
- Mathematical functions
- Trigonometric functions
- String functions
- Unicode functions
- Regular expressions
- Date and time functions
- Time-series functions
- Random-number functions
- 18 functions
- Stream random numbers
- Matrix functions
- Programming functions
Data reorganization
- Row–column transposition
- Data reshaping Updated
- Stacking of variables
- Collapsing into means, totals, etc.
Unicode support
- UTF-8
- Translation of extended ASCII to UTF-8
- Unicode-aware string functions
- Locale-based sorting and string comparison
Labels
- Dataset labels
- Variable labels
- Value labels (e.g., male and female for 0 and 1)
- Ability to switch between multiple sets of data, variable, and value labels
- Missing-value labels
- Support for multiple languages, including Unicode support
Notes
- Extensive notes can be attached to a dataset
Data snapshots
- Allow multiple levels of undo to modified datasets
Multiple datasets in memory (frames)
- Link frames
- Copy data between frames
- Copy labels between frames New
- Access data in other frames
- Post simulation results to frame
- Manipulate frames
- Access frames from Mata
- Access frames from Mata
- Save, load, describe multiple frames Updated
Automatic memory management
Sorting
- Ascending or descending sorts
- Multiple-key sorts
- Numeric and string sorts
- Locale-aware Unicode string sorting and comparison
Combining datasets
- Merge datasets
- By key variables
- By observations
- Join datasets
- Outer join
- Append datasets
- Append time series
Special datasets
Utilities
- Count number of observations that satisfy specified conditions
- Formatted and unformatted disk I/O
- Zip-file support
- Unicode conversion from/to extended ASCII
- Custom filters to manipulate text files
Variable management
- Generation of new variables
- Replacement of existing variables
- Renaming variables
- Encoding and decoding string variables
- Reordering variables in dataset
- Variables Manager
Dataset utilities
- Flexible description of variables, labels, and types
- List values of variables Updated
- Data signatures to verify the integrity of datasets
- Codebooks for variables
- Value-label reports
- Duplicates and missing values tables
- Compress (make dataset as small as possible without loss of accuracy)
Variable types
- Numeric storage types
- Byte
- Integer (int)
- Long
- Float
- Double
- String (including Unicode, very long strings and BLOBs)
- Dates and times
- Business calendars
Long string support
- Up to 2 billion character long strings
- Coalescing of duplicate values to save memory
- Binary 'strings' (BLOBs)
- Import and export entire files into long strings/BLOBs
- Unicode (UTF-8) strings
Stored results
- Save results to disk for later use
- Store estimation results in memory
- Create tables to compare results
- Create custom tables
Additional resources
See New in Stata 19 to learn about what was added in Stata 19.